Translational Symmetry in Subsequence Time-Series Clustering

نویسنده

  • Tsuyoshi Idé
چکیده

We treat the problem of subsequence time-series clustering (STSC) from a group-theoretical perspective. First, we show that the sliding window technique introduces a mathematical artifact to the problem, which we call the pseudo-translational symmetry. Second, we show that the resulting cluster centers are necessarily governed by irreducible representations of the translational group. As a result, the cluster centers necessarily forms sinusoids, almost irrespective of the input time-series data. To the best of the author’s knowledge, this is the first work which demonstrates the interesting connection between STSC and group theory.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Why Does Subsequence Time-Series Clustering Produce Sine Waves?

Data mining and machine leaning communities were surprised when Keogh et al. (2003) pointed out that the k-means cluster centers in subsequence time-series clustering become sinusoidal pseudopatterns for almost all kinds of input time-series data. Understanding this mechanism is an important open problem in data mining. Our new theoretical approach (based on spectral clustering and translationa...

متن کامل

A Review of Subsequence Time Series Clustering

Clustering of subsequence time series remains an open issue in time series clustering. Subsequence time series clustering is used in different fields, such as e-commerce, outlier detection, speech recognition, biological systems, DNA recognition, and text mining. One of the useful fields in the domain of subsequence time series clustering is pattern recognition. To improve this field, a sequenc...

متن کامل

Theorectical Analysis of Subsequence Time-Series Clustering from a Frequency-Analysis Viewpoint

Although Subsequence Time Series (STS) clustering is one of the most popular pattern discovery techniques from timeseries data, a mathematical methodology for analyzing STS clustering (or pattern discovery from time-series data) has attracted little attention. In the situation, it has had a surprising report [10] that cluster centers obtained using STS clustering closely resemble ”sine waves” w...

متن کامل

Useful Clustering Outcomes from Meaningful Time Series Clustering

Clustering time series data using the popular subsequence (STS) technique has been widely used in the data mining and wider communities. Recently the conclusion was made that it is meaningless, based on the findings that it produces (a) clustering outcomes for distinct time series that are not distinguishable from one another, and (b) cluster centroids that are smoothed. More recent work has si...

متن کامل

Selective Subsequence Time Series clustering

0950-7051/$ see front matter 2012 Elsevier B.V. A http://dx.doi.org/10.1016/j.knosys.2012.04.022 ⇑ Corresponding author. Tel.: +66 8 9499 9400; fax E-mail addresses: [email protected] (S. Ro chula.ac.th (V. Niennattrakul), [email protected] Subsequence Time Series (STS) Clustering is a time series mining task used to discover clusters of interesting subsequences in time series data...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006